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Abstract 


The distribution of particles inside hadronic jets produced in the decay of boosted W and 
Z bosons can be used to discriminate such jets from the continuum background. Given 
that a jet has been identified as likely resulting from the hadronic decay of a boosted 
IT or Z boson, this paper presents a technique for further differentiating Z bosons from 
W bosons. The variables used are jet mass, jet charge, and a Z?-tagging discriminant. 
A likelihood tagger is constructed from these variables and tested in the simulation of 
W —> WZ for bosons in the transverse momentum range 200 GeV < < 400 GeV in 

^[s = % TeV pp collisions with the ATLAS detector at the LHC. For Z-boson tagging 
ethciencies of ez - 90%, 50%, and 10%, one can achieve W^-boson tagging rejection 
factors (l/eiy+l of 1.7, 8.3 and 1000, respectively. It is not possible to measure these 
efficiencies in the data due to the lack of a pure sample of high pj, hadronically decaying 
Z bosons. However, the modelling of the tagger inputs for boosted W bosons is studied 
in data using a ff-enriched sample of events in 20.3 fb”' of data at = 8 TeV. The 
inputs are well modelled within uncertainties, which builds confidence in the expected 
tagger performance. 
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1. Introduction 


Processes involving the production and decay of W and Z bosons provide benchmarks for testing the 
Standard Model (SM), as well as probes of physics beyond the SM (BSM). Since the cross section for 
the direct strong production of events with multiple jets (QCD multijets) at the Large Hadron Collider 
(LHC) is much larger than for W and Z boson production, it is usually the case that the leptonic decays 
of bosons must be used to reduce the overwhelming background. However, when the momentum 
of a boson V is comparable with its mass, my, the spatial proximity of the decay products provides a 
new set of tools that can be used to distinguish between jets from hadronic boson decays and jets ori¬ 
ginating from QCD multijet backgrounds. In particular, since the angle between the decay products of 
a boson V scales with Imylp^, for large p^,jet substructure techniques become powerful tools. This 
leads to a tradeoff between using relatively pure leptonic decays and high-branching-ratio hadronic 
decays. In some BSM theories, new particles similar to IT/Z bosons do not couple directly to leptons, 
so searching for hadronic decays of heavy particles is essential. 

Jet substructure techniques developed to distinguish hadronically decaying W and Z bosons from QCD 
multijet background processes have become increasingly sophisticated. A recent review is given in 
Ref. [1]. Both ATLAS [2] and CMS [3] have performed detailed comparisons of the various tagging 
variables and jet-grooming techniques with the overall conclusion that large QCD multijet suppression 
factors' are possible while maintaining acceptable levels of boson tagging efficiency. Given a IT/Z- 
boson tagger, a natural next step is to distinguish boson types. 

There are several important possible applications of a boson-type tagger at the LHC. First, a type 
tagger could enhance the SM physics program with W and Z bosons in the final sfafe. Measuremenfs 
of fhis kind include fhe deferminafion of fhe cross sections for F-i-jefs, VV, and tt + V. Anofher 
imporfanf use of a boson-fype fagger is in searches for flavour-changing neufral currenfs (FCNC). 
Due fo fhe Glashow-Iliopoulos-Maiani (GIM) mechanism [4], FCNC processes in fhe SM are highly 
suppressed. Many models of new physics predicf large enhancemenfs fo such processes. Bofh ATLAS 
and CMS have performed searches for FCNC [5, 6] of fhe form t —> Z <7 in fhe lepfonic channels, buf 
fhese could be exfended by ufilizing fhe hadronic Z decays as well. FCNC processes mediafed by a 
lepfophobic Z' such as f —> Z'q may be defecfed only via hadronic fype-fagging mefhods. A fhird 
use of a boson-fype fagger is fo cafegorize fhe properfies of new physics, if discovered af fhe LHC. 
For insfance, if a new boson were discovered as a hadronic resonance, a boson-fype fagger could 
pofenfially disfinguish a 1T'(—> qq) from a Z'(^ qq) (where mass alone may nol be useful). This 
is especially relevanf for lepfophobic new bosons, which could nol be disfinguished using lepfonic 
decays. 

Labelling jefs as originating from aW or Z boson is less ambiguous fhan quark/gluon labelling. A W 
boson can radiafe a Z boson, jusf like a quark can radiate a gluon, buf fhis is heavily suppressed for 
fhe former and nol for fhe latter. The radiation pattern of jefs from W- and Z-bosons is less lopology 
dependenl because if is largely independenl of fhe olher radialion in fhe evenl as W and Z bosons are 
colour singlels. Aside from fhe producfion cross secfion and sublie differences in differential decay 
dislribufions, fhe only fealures fhal disfinguish belween W and Z bosons are fheir mass, charge, and 
branching rafios. Experimenlally, fhis means lhaf fhe only variables fhal are useful in discriminating 
belween hadronic decays of W and Z bosons are Ihose which are sensilive fo fhese properfies. The 
Ihree variables used in fhe analysis presenfed here are jet mass, sensitive fo fhe boson mass, yet charge. 


* 0 ( 1 %) QCD multijet efficiency at 50% signal efficiency. 
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sensitive to the boson charge, and a b-tagging discriminant which is sensitive to the heavy-flavour 
decay branching fractions of the bosons. The application of a boson-type tagger in practice will be 
accompanied by the prior use of a boson tagger (to reject QCD multijet processes). The type-tagger 
variables are largely independent of typical boson-tagger discriminants like n-subjettiness [7], which 
rely on the two-prong hard structure of both the W and Z decays^. 

This paper introduces a jet tagging method to distinguish between hadronically decay W and Z bosons 
at the LHC, and documents its performance with the ATLAS detector at = 8 TeV. The paper is 
organized as follows. Section 2 describes the simulated datasets used in constructing and evaluating 
the boson-type tagger. Following a discussion of the differences between the properties of W and Z 
bosons in Sect. 3, Sect. 4 defines the three discriminating variables. The construction and performance 
of the tagger are detailed in Sect. 5 and the sensitivity to systematic uncertainties is described in 
Sect. 6. The input variables are studied in a dataset enriched in boosted W bosons in Sect. 7. The 
paper ends with a discussion of possible uses of the tagger in Sect. 8 and conclusions in Sect. 9. 


2. Datasets 

Two sets of Monte Carlo (MC) simulations are generated, one to study the tagger’s W versus Z per¬ 
formance and the other to compare the tagger inputs for W bosons with the data. Simulations of 
hypothetical W —> fTZ production and decay provide a copious source of boosted W and Z bosons 
whose pt scale is set by the mass of the W boson. Such events are used to construct a tagger to 
separate hadronically decaying boosted W and Z bosons, as well as to evaluate its performance. It 
is not possible to measure the performance directly in the data due to the lack of a pure sample of 
boosted, hadronically decaying Z bosons, but the modelling of the tagger inputs can be studied using 
hadronically decaying W bosons from tt events in the data. 

A simulated sample of W bosons is generated with PYTHIA 8.160 [8] using the leading-order parton 
distribution function set (PDF) MSTW2Q08 [9, 10] and the AU2 [11] set of tunable parameters (tune) for 
the underlying event. The baseline samples use PYTHIA for the 2^2 matrix element calculation, as 
well as pT-ordered parton showers [12] and the Lund string model [13] for hadronization. Additional 
samples are produced with HERWIG-h- [14], which uses angular ordering of the parton showers [15], 
a cluster model for hadronization [16], as well as the EE 3 [17] underlying-event tune. The W differs 
from the SM W boson only in its mass and the branching ratio W ITZ is set to 100%. The W and Z 
bosons are produced with a mixture of polarizations, but the longitudinal polarization state dominates 
because niw, mz my/’. In order to remove artifacts in the px distributions of the W and Z bosons due 
to the generation of W particles with discrete masses, the p^ spectra are re-weighted to be uniform 
in the range 200 GeV< p^ < 400 GeV. As is discussed in Sect. 1, for px > 200 GeV, a jet with large 
radius is expected to capture most of the IT or Z boson decay products. The range is truncated to 
Px < 400 GeV because hadronically decaying W bosons can be probed with data in this px range; 
there are too few events in the 8 TeV dataset for px > 400 GeV. 

Top-quark pair production is simulated using the next-to-leading-order (NLO) generator POWHEG-BOX 
[18-20] with the NLO PDF set CTl® [10] and parton showering from PYTHIA 6 [21]. The single¬ 
top ( 5 -, t-, and ITt-channel) backgrounds are modelled with POWHEG-BOX and PYTHIA 6, as for the 
nominal tt simulation. The PDF set CT10f4 [9] is used for the t-channel and CTl® is used for the 


^ See Sect. A for details. 
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5- and Wf-channels. For the VFt-channel, the ‘inclusive Diagram Removal’ (DR) scheme is used 
for overlap with tt [22]. The VF+jets and Z+jets backgrounds are modelled with ALPGEN 2.1.4 [23], 
PYTHIA 6 and the CTEQ6L1 PDF set [24]. Dibosons are generated with HERWIG 6.52Q. 2 [25] using 
the CTEQ6L1 PDF set and the AUET2 tune [26]. Version 6.426 is used everywhere for PYTHIA 6, 
with the Perugia2QllC tune [27]. 

Events are processed with a full simulation of the ATEAS detector and trigger [28] based on the 
Geant4 [29] toolkit, and reconstructed using the same software as for the experimental data. The 
average number of additional pp collisions per bunch crossing (pileup interactions) was 20.7 over the 
full 2012 run. The effects of pileup are modelled by adding multiple minimum-bias events, which are 
simulated with PYTHIA 8.16®, to the generated hard-scatter events. The distribution of the number of 
interactions is then weighted to reflect the pileup distribution in the 2012 data. A sample of W bosons 
is selected from data taken in 2012 at centre-of-mass energy of ^[s = 8TeV from tt candidates as 
described in Sect. 7. 


3. Distinguishing a Z boson from a W boson 

Decays of IT or Z bosons are characterized by the boson’s mass and coupling to fermions. The 
mass difference between the W and Z boson is about 10 GeV and if produced from a hard scatter 
or the decay of a heavy enough resonance, both bosons are produced nearly on-shell since the width 
Fv = 2.1 (2.5) GeV is much less than the mass my - 80.4 (91.2) GeV for W (Z) bosons [30]. The 
Breit-Wigner resonance curves for W and Z bosons are shown in Fig. 1(a). The separation between 
the curves is a theoretical limit on how well mass-sensitive variables can distinguish between W and 
Z bosons. For hadronic boson decays, the mass peaks measured with jets are broader. This is because 
the jet-clustering algorithm for final-state hadrons loses particles at large angles to the jet axis and 
includes extra particles from the underlying event and pileup. 

The generic coupling of a boson V to fermions is given by 9v7^[cv - caTj], where gy is a boson- 
dependent overall coupling strength, and cy and ca are the vector and axial-vector couplings, respect¬ 
ively. The W boson couples only to left-handed fermions so cv - ca = 1 with gy/ oc kVcGpm^l V 17 P, 
where Gp is the Fermi coupling constant, Vij is a Cabibbo-Kobayashi-Maskawa (CKM) matrix ele¬ 
ment [31, 32], k represents higher-order corrections, and Ac = 3 for the three colours of quarks and 
Nc = 1 for leptons. The CKM matrix is nearly diagonal so ud and VF'*' ^ cs are the dominant 

decay modes. Small off-diagonal elements contribute to the other possible decay modes, and the over¬ 
all hadronic branching ratios are approximately 50% for IF —> cX and 50% for IF ^ light-quark pairs. 
The VF boson has electric charge ± 1 in units of the electron charge, so by conservation of charge, its 
decay products have the same net charge. The scalar sum of the charge of all the final-state hadrons 
originating from a IF boson decay is not infrared safe (directly sensitive to the non-zero detection 
threshold), so there are limits to the performance of charge tagging dictated by the energy threshold 
placed on charged particles in the event reconstruction. 

In contrast to VF boson decays, Z bosons decay to both the left- and right-handed fermions. The partial 
width for Z ^ // is proportional to kNcGym^^{Cy + c^]. The factors cy and c\ are slightly different 
for up- and down-type fermions. The bb branching ratio is 22%, the cc branching ratio is 17% and 
the sum of the remaining branching ratios is 61%. VF boson decays to ^-quarks are highly suppressed 
by the small CKM matrix elements Vcb and Vub, so that identifying Z>-hadron decays associated with 
a hadronically decaying boson is a powerful discriminating tool. Branching ratios are plotted in 
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Fig. 1(d) for Z decays to light quarks, c-quarks, and b- quarks, and in Fig. 1(c) for the W boson decays 
to light quarks and c-quarks. 

Since the coupling structure is not identical for W and Z bosons, the total decay rates differ, and 
the angular distributions of the decay products also differ slighfly. However, even af parfon level 
withouf any combinaforic noise, fhe differences in fhe angular disfribufions are subfle. There is no 
difference for fhe fwo bosons wifh longifudinal polarizafion because fhe distribufions for right- and 
left-handed fermions are the same. The distributions are different for right- and left-handed fermions 
for transversely polarized W and Z bosons, as shown in Fig. 1(b). The relative contribution of left- 
and right-handed components for the Z decays depends on the quark flavour; for up-fype quarks fhe 
relafive confribufion from righf-handed fermions is 15% while if is only 3% for down-fype quarks. In 
tt decays, fhe fracfion of longitudinally polarized W bosons (ignoring the f^-quark mass) is l{m^ + 
2m^) ~ 0.7. In contrast, the boson is mostly transversely polarized in inclusive F-i-jets events. Any 
discrimination shown in Fig. 1(b) is diluted by the longitudinal polarization, combinatorics, non- 
perturbative effects, and detector reconstruction, so angular distributions are not considered further in 
this paper^. 


4. Definitions of reconstructed objects 

ATLAS is a multi-purpose particle detector [33] with nearly coverage in solid angle.^ The energy 
of the hadronic decay products of boosted bosons is measured by a system of calorimeters. The 
electromagnetic calorimeter consists of a Pb/liquid-argon sampling calorimeter split into barrel (\r]\ 
< 1.5) and endcap ( 1.5 < |? 7 l < 3.2) sections. The hadronic calorimetry is provided by a barrel 
steel/scintillating-tile calorimeter {\r]\ < 1.7) and two endcap Cu/liquid-argon sections (1.5 < I 77 I < 
3.2). Finally, the forward region (3.1 < \t]\ < 4.9) is covered by a liquid-argon calorimeter with Cu (W) 
absorber in the electromagnetic (hadronic) section. Energy depositions are grouped into topological 
calorimeter-cell clusters [34] and then calibrated using the local cluster weighting algorithm [35, 36]. 
Jets are formed from clusters using two different jet algorithms. Small-radius jets are built with the 
anti-kf algorithm with jet radius parameter R = 0.4 [37]. Large-radius jets are formed using the anti- 
kt algorithm with = 1.0 and then trimmed [38] by re-clustering the jet constituents with the kt 
algorithm using R = 0.3 and removing the constituents with pj less than 5% of the original jet pT- 
Both the small- and large-radius jets are further calibrated to account for the residual detector response 
effects. For small-radius jets, this is a pt- and p-dependent energy calibration, plus a correction to 
mitigate the contribution from additional pp collisions and to suppress jets from these additional 
collisions [39]. In addition to pj- and p-dependent energy corrections, large-radius jets J have a 
calibrated jet mass'. 


^ The impact of polarization on distinguishing boosted W boson jets from QCD multijets has been studied in Ref. [3]. 
There are small differences in performance between transversely and longitudinally polarized bosons, but any differences 
are less relevant for W versus Z tagging where the angular distributions are identical for longitudinally polarized bosons 
and only slightly differ for transversely polarized bosons. 

* ATLAS uses a right-handed coordinate system with its origin at the nominal interaction point (IP) in the centre of the 
detector and the z-axis along the beam pipe. The x-axis points from the IP to the centre of the LHC ring, and the y-axis 
points upward. Polar coordinates (r, 0) are used in the transverse plane, (j) being the azimuthal angle around the beam 
pipe. The pseudorapidity is defined in terms of the polar angle 0 as 77 = -In tan(6/2). Transverse momentum and energy 
are defined in the x-y plane as pj = p ■ sin( 6 ) and Ej = E ■ sin( 6 ). 
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Figure 1; (a) Breit-Wigner resonances for the W (red) and Z (blue) bosons, (b) angular distribution of the decay 
products of transversely polarized W/Z bosons with respect to the spin direction in the boson rest frame, (c) 
hadronic branching fractions of the boson, and (d) of the Z boson. In (c) and (d), light stands for decay 
modes not involving c and b quarks. 


'' jsj ' '' j^J ’ 

where Ej is the energy of cluster j and pj is a vector with magnitude Ej and direction rjj). The 
jet mass calibration depends on the calibrated jet energy and on the jet rj [45]. When a W or Z boson 
is produced with large enough momentum, its decay products are collimated. When 2mylp^ ~ 1, an 
= 1.0 trimmed jet captures a large fraction of the decay products and the jet-mass scale is set by 
my. Since the W and Z boson masses differ by about 10 GeV, the jet mass can be used to discriminate 
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between these two particles. The distributions of the boson masses and jet masses for hadronically 
decaying W and Z bosons are shown in Fig. 2. The particle-level (‘truth’) jet mass is constructed 
from stable particles in the MC simulation (cr > 10 mm), excluding neutrinos and muons, clustered 
with the same jet algorithm as for calorimeter-cell clusters. The QCD processes that govern the 
formation of stable particles from the W and Z decay products create a broad distribution of jet masses 
even without taking into account detector resolution. Constructing the jet mass from calorimeter¬ 
cell clusters further broadens the distribution. The jet-mass resolution (physical 0 detector) is large 
compared to the natural width of the W and Z bosons and comparable to the difference in their masses. 
For example, the standard deviation of the detector resolution is approximately 

10%. The jet-mass variable nevertheless has some discriminating power. 



Boson Mass [GeV] Truth Jet Mass [GeV] Reconstructed Jet Mass [GeV] 


(a) 


(b) 


(c) 


Figure 2: (a) The boson mass at generator level, (b) ‘truth’ jet mass (at particle level) after parton fragmentation, 
and (c) reconstructed jet mass distributions. The left plot has a different vertical scale than the right two plots 
and also has no pj requirement. 


The momentum and electric charge of particles traversing the detector contain information about the 
charge of their parent boson. The tracks of charged particles are measured in a 2 T axial field generated 
by a solenoid magnet which surrounds the inner detector (ID) consisting of silicon pixels, silicon 
micro-strips, and a transition radiation tracking detector. Charged-particle tracks are reconstructed 
from all three ID technologies with a full coverage in cf), It/I < 2.5 and pj > 400 MeV. The charge q 
of a track is determined as part of the reconstruction procedure, which uses a fit with five parameters: 
the transverse and longitudinal impact parameters, 0, 9, and qjp, where p is the track momentum. 
To suppress the impact of pileup, tracks are required to originate from the primary collision vertex, 
which is defined as the vertex with the largest Yj p\ computed from associated tracks. Additionally, 
tracks must satisfy a very loose quality criterion for the track fit pcr degree of freedom, which 
must be less than three. Tracks are associated with jets using ghost association [40]. The charge of 
tracks associated with a jet is sensitive to the charge of the initiating parton. In order to minimize the 
fluctuations due to low-px particles, the. jet charge is calculated using a px-weighting scheme [41]: 


Qj = 


1 

(PtjY 


(sTracks 


( 2 ) 


where Tracks is the set of tracks with px > 500 MeV associated with jet J, qi is the charge (in units of 
the electron charge) determined from the curvature of track i with associated pl^, a- is a free parameter. 
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and pjj is the transverse momentum of the jet measured in the calorimeter. The calorimeter energy 
is used in the denominator to determine instead of the sum of track momenta to account for the 
contribution from neutral particles. Dedicated studies have shown that k - 0.5 is generally best for 
determining the charge of partons from the jets they produce [42]. The distributions of the jet charge 
for jets initiated by IT'*', W~ and Z bosons are shown in Fig. 3. There is an observable separation 
between positive and negative W bosons. The expected charge composition of a IT sample is process 
dependent. For example, there are more IT''' than 1T“ bosons in inclusive IT' production because of the 
initial charge asymmetry of quarks in the proton resulting in more W'Z) than 1T'~(—> 1T“Z). 

The discrimination between Z bosons and a near even mixture of IT- is greatly diminished with 
respect to e.g. Z versus IT'*'. In that case charge sensitive variables are not very useful for the tagger 
and so all results are shown also without such variables. In a variety of physics processes, the charge 
of the hadronically decaying IT boson is known from other information in the event. For example, 
in searches for FCNC effects in tt events with one leptonically decaying IT boson, the charge of the 
lepton is opposite to the charge of the hadronically decaying IT boson. Henceforth, only IT'*' bosons 
are used for constructing the boson-type tagger; the results are the same for 1T“ bosons. 
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Figure 3: The jet charge distribution for jets originating from IT* and Z bosons in simulated IT' decays. Each 
distribution is normalized to unity. The parameter k controls the px-weighting of the tracks in the jet charge 
sum. 


The tracks from charged particles can be used further to identify the decays of certain heavy-flavour 
quarks inside jets due to the long Z^-hadron lifetime. This is useful for boson-type tagging because 
the Z boson couples to bb while decays of the IT boson to Z^-quarks are highly suppressed and can 
be neglected. ATLAS has commissioned a ^-tagging algorithm called MVl (defined in Refs. [43] 
and [44]) which combines information about track impact-parameter significance with the explicit 
reconstruction of displaced b- and c-hadron decay vertices. The boson-type tagger presented here 








uses multiple bins of the MV 1 distribution simultaneously. Five bins of MV 1 are defined by fj-tag 
efficiencies (probability to tag a f^-quark jet as such) of 0%-50%, 50%-60%, 60%-70%, 70%-80%, 
and 80%-100% as determined in simulated tt events. A lower f7-tag efficiency leads to higher light- 
quark jet rejection. The five fj-tagging efficiency bins are exclusive and MV 1 is constructed as a 
likelihood with values mostly between zero and one (one means more like a f^-jet). For example, a 
100% f?-tagging efficiency corresponds to a threshold of MVl > 0 and an 80% f?-tagging efficiency 
corresponds to a threshold value of MVl > z for z 1. The 80%-100% fj-tag efficiency bin then 
corresponds to jets with an MV 1 value between 0 and z. Constructed in this way, the fraction of true 
fj-jets inside an efficiency bin x%-y% should be {y - x)%. 

Small-radius jets are matched to a large-radius jet by geometric matching^ (AR < 1.0). Of all such 
small-radius jets, the two leading ones are considered. There are thus 30 possible bins of combined 
MV 1 when considering the leading and sub-leading matched small-radius jet. The number of bins is 
25 from the 5x5 efficiency-binned MV 1 distributions in addition to five more for the case in which 
there is no second small-radius jet matched to the large-radius jet. The distribution for the efficiency- 
binned MVl variable for the leading and sub-leading matched small-radius jets is shown for W and Z 
bosons in Fig. 4. The flavour of a small-radius jet is defined as the type of the highest energy parton 
from the parton shower record within AR < 0.4. As expected, a clear factorization is seen in Fig. 4 - 
the MV 1 value depends on the flavour of the small-radius jet and not the process that created it. This 
means that c-jets from W decays have the same MV 1 distribution as c-jets from Z decays; the same 
is true for light jets. Small-radius jets originating from f^-hadron decays tend to have a larger value 
of MVl, which means they fall in a lower efficiency bin. Small-radius jefs not originating from b- or 
c-decays are called light jets and are strongly peaked in the most efficient bin of MVl. There is always 
one small-radius jet matched to the large-radius jet, but about 20% of the time there is no sub-leading 
small-radius jet with pj > 25 GeV matched to the large-radius jets. These cases are all predicted to 
originate from light-quark decays of the W and Z bosons. 


5. Tagger performance 


The optimal multivariate tagger combining jet mass, jet charge, and the MV 1 of matched small-radius 
jets is constructed from a three-dimensional (3D) likelihood ratio. For N bins each of jet mass and 
jet charge, as well as 30 combined MV 1 bins, the 3D likelihood ratio would have 30 x total bins. 
Populating all of these bins with sufficient MC events to produce templates for the likelihood ratio 
requires an unreasonable amount of computing resources, especially for the high-efficiency bins of 
combined MVl. Estimating the 3D likelihood as the product of the ID marginal distributions, where 
all variables but the one under consideration are integrated out, is a poor approximation for jet mass 
and combined MV 1 due to the correlation induced by the presence of semileptonic f^-decays, which 
shift the jet mass to lower values due to the presence of unmeasured neutrinos.^ It is still possible to 


^ In the definition of jets, R is the characteristic size in (y, (p) and the rapidity y is used in the jet clustering procedure, 
whereas geometrical matching between reconstructed objects is performed using (AR)^ = + {Arff-, where y is the 

pseudorapidity. 

® The muons from semileptonic decays are added back to the jet using a four-momentum sum. Muons are measured by 
the combination of a dedicated muon spectrometer with its own toroidal magnetic field outside the calorimeters, and the 
inner detector. Adding back the muon has a negligible impact on the inclusive mass distribution due to the semileptonic 
branching ratios and lepton identification requirements. For details about the muon reconstruction and selection, see 
Sect. 7 (the only difference here is that the isolation is not applied). 
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Figure 4; The efficiency-binned MV 1 distribution for small-radius jets associated with large-radius jets resulting 
from W and Z boson decays. The left (right) plot shows the leading (sub-leading) small-radius jet MV 1 distri¬ 
bution. The bins correspond to exclusive regions of fe-jet efficiency. As such, the bin content of the black line 
(/^-tagging for fe-jets) should be proportional to the size of the efficiency window; about 50% for the rightmost 
bin, 10% for the three middle bins and 20% for the second bin. 


use a simple product by noting that all three tagger inputs are independent when the flavour of the 
decaying boson has been determined. Thus, for each possible boson decay channel, templates are 
built for the jet mass, the jet charge, and the efficiency-binned MVl distributions. For a particular 
decay flavour, the joint distribution is then the product of the individual distributions. Summing over 
all hadronic decay channels then gives the full distribution. To ease notation, the efficiency-binned 
MVl is denoted B = (Biead> ^sub-lead)- The distribution for Biead (^sub-lead) is shown in the left (right) 
plot in Fig. 4. Symbolically, for decay flavour channel !F, mass M, charge Q, and efficiency-binned 
MV 1 B, the likelihood is given by: 


p{M, Q, B\v) = 2] Mr\v)p{M\r, v)p(Q\r, v) pmr, v), o) 

r 

where^ V € {W,Z} and the sum is over 'F = bb, cc, cs, cd and light-quark pairs. The distribution of B 
is well approximated as the product of the distributions for Biead and Bsub-iead when the flavours of the 
leading and sub-leading jets are known. This is exploited for hadronically decaying W bosons and for 
the light-quark flavour decays of Z bosons to construct templates for B that have a sufficient number 
of simulated events for large values of B, i.e. Px{B\F,V) = PrlBieadl?^, V)Pr(Bsub-iead|!T', V). The 
unit-normalized templates for B are shown in Fig. 4 and the unit-normalized templates V) and 

p{Q\F, V) are shown in Fig. 5. For a given boson type, the jet-charge template is nearly independent 
of the flavour. However, there is a dependence of the jet mass on the (heavy) flavour of the boson 
decay products. 

The likelihood function is constructed by taking the ratio of the probability distribution functions 
p{M,Q,B\V), for V e {W,Z}, determined from the templates in Eq. (3). Every bin i of the 3D 

’ The symbol p denotes a probability density whereas Pr denotes a discrete probability distribution. 
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(a) (b) 

Figure 5; (a) The jet mass p(M\'F, V) and (b) jet charge piQlT, V) templates conditioned on the flavour 'F of 
the boson V decay for jets with 200 GeV < pj < 400 GeV. The solid lines are for Z boson decays and the 
dashed lines are for W boson decays. 


histogram that approximates p{M, Q, B\V) is assigned a pair of numbers (/, Si/bi) where Si is the overall 
fraction of the signal (Z or W) in bin i and bi is the fraction of the overall background (the other boson 
flavour) in bin i. Bins are then sorted from largest to smallest stlbi, with /(/) defining a map from the 
old bin index to the new, sorted one. There are then two ID histograms: for the signal, bin j has bin 
content and for the background, bin j has bin content bf-i^y The optimal tagging procedure is 
then to set a threshold on the new ID histograms. The full likelihood ratio of the combined tagger is 
shown in Fig. 6 where the thresholds required for 90%, 50%, and 10% Z-boson tagging efficiency are 
marked with shaded regions. 

Curves displaying the tagging performance for all possible subsets of {M, Q,B} are shown in Fig. 7. 
There are 30 possible values for B, which are therefore represented by discrete points. The jet mass is 
the best performing single variable for medium to high Z-boson efficiencies, with visible improvement 
for M+B and M+Q. There is a significant gain from combining all three variables for Z-boson tagging 
efficiency above about 20%. Below 20%, the combined tagger is dominated by B where the Z ^ bb 
branching fraction no longer limits Z-boson tagging efficiency. For Z-boson efficiencies of about 
50%, one can achieve rejection factors (l/6vv+) of 3.3 by using Q ox B alone and about 5.0 
using mass alone. For Z efficiencies of ez = 90%, 50%, and 10%, rejection factors of 1.7, 8.3, 
and 1000, respectively, can be achieved with the combined tagger. Although most applications of 
boson-type tagging will target Z bosons as the signal while rejecting W bosons as background, the 
likelihood constructed in Fig. 6 can also be used to optimally distinguish IF'*' bosons from Z bosons. 
The corresponding performance curves are shown in Fig. 8. The locations of the fi-tagging points are 
all now shifted to high efficiency with respect to Fig. 7 because, for tagging, one wants to operate 
in the high-efficiency b-t&gging bins (whereas the opposite is optimal for Z tagging). At an efficiency 
of ew+ - 50%, a Z-boson rejection factor of 1/ez » 6.7 can be achieved. 
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Full Likelihood Ratio (Jet Mass+Jet Charge+b-tagging) 


Figure 6: The full likelihood ratio for the tagger formed from jet mass, jet charge, and a small-radius jet b- 
tagging discriminant. The black histogram shows the likelihood ratio for Z bosons and the red histogram is the 
likelihood ratio for bosons. The shaded areas show the region of the likelihood ratio corresponding to 90%, 
50%, and 10% working points of the Z-boson tagging efficiency. 


6. Systematic uncertainties 

The performance curves in Fig. 7 are based on the nominal modelling parameters of the ATLAS 
simulation. Additional studies show how the curves change due to the systematic uncertainties on the 
inputs to the likelihood function. Sources of experimental uncertainty include the calibrations of the 
large- and small-radius jet four-momenta, the Z^-tagging (which incorporates e.g. impact parameter 
modelling), and the modelling of track reconstruction. 

The uncertainty on the scale of the large-radius jet mass calibration is estimated using the double 
ratio in data and MC simulation of calorimeter jet mass to track jet mass [45]. Tracks associated 
with a jet are well measured and provide an independent observable correlated with the jet energy. 
Uncertainties on the jet-mass resolution can have a non-negligible impact on the performance of the 
tagger. The jet-mass resolution uncertainty is determined from the difference in the widths of the 
boosted W boson jet-mass peak in semileptonic tt simulated and measured data events [45] and also 
from varying the simulation according to its systematic uncertainties [46]. The resolution is about 
5 GeV in the Gaussian core of the mass spectrum and its uncertainty is about 20%. The impact of the 
jet-mass scale and resolution uncertainties on the boson-type tagger built using only the jet mass is 
shown in Fig. 9 for two nominal working points of 50% and 90% Z-boson tagging efficiency. Both 
the likelihood map / from Sect. 5 and the threshold value are fixed. Inputs to the tagger are shifted 
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(a) (b) 

Figure 7; The tradeoff between Z efficiency and (a) 1- (W^ efficiency) (b) or 1/{W^ efbciency) on (a) a linear 
scale and (b) a logarithmic scale. Each curve is constructed by placing thresholds on the likelihood constructed 
from the inputs indicated in the legend. Since the h-tagging discriminant is binned in efficiency, there are only 
discrete operating points for the tagger built only from B. 




(a) (b) 

Figure 8: The tradeoff between efficiency and (a) 1- (Z efficiency) or (b) 1/(Z efbciency) on (a) a linear 
scale and (b) a logarithmic scale. Each curve is constructed by placing thresholds on the likelihood constructed 
from the inputs indicated in the legend. Since the h-tagging discriminant is binned in efbciency, there are only 
discrete operating points for the tagger built only from B. 
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by their uncertainties and the ID histograms described above are re-populated. The efficiencies for 
W and Z bosons are recomputed and shown as markers in Fig. 9(a). Coherent shifts of the jet masses 
(JMS) for W and Z bosons result in movement along the nominal performance curve corresponding to 
±10% changes in the efficiency. However, there are also shifts away from the nominal curve because 
the optimal jet-mass cut is not a simple threshold. Variation of the jet-mass resolution (JMR) preserves 
the scale and so the movement is nearly perpendicular to the original performance curve, at the <5% 
level, because of the increased overlap in the Z and W mass distributions.^ Shifts along the nominal 
curve optimally use the input variables (albeit at different efficiencies), while shifts away from the 
nominal curve are a degradation in the performance. The impact of the fragmentation is estimated by 
using input variables from HERWIG but with the likelihood map from PYTHIA. PYTHIA and HERWIG 
have similar VT/Z efficiencies at both the 50% and 90% benchmark points. 

The systematic uncertainty on the efficiency of the tracking reconstruction is estimated by removing 
tracks associated with jets using an 77 -dependent probability [47]. The probability in the region 2.3 < 
I 77 I < 2.5 is 7%; it is 4% for 1.9 < I 77 I < 2.3, 3% for 1.3 < I 77 I < 1.9, and 2% for 0 < I 77 I < 1.3. These 
probabilities are known to be conservative in the most central rj bins. There is also an uncertainty on 
the modelling of track merging for high-pT jets, but the loss is expected to be negligible for jets with 
Pt < 400 GeV. Differences in the modelling of fragmentation can affect the expected performance 
for all the input variables, especially for the track-dependent observables. The impact of various 
uncertainties on the boson-type tagger built using only the jet charge is shown in Fig. 9(b). Since W 
and Z boson decays produce on average many tracks (see Sect. 7), removing a small number of them 
does not have a big impact on the jet-charge tagger as a result of the px-weighting in the jet charge 
sum. 

The efficiency to ( 7 -tag jets of various flavours {b, c, and light) is measured in data using tt events [43], 
jets with identified charm hadrons, and multijet events [44]. The differences between data and MC 
simulation are typically a few percent and are applied as independent correction factors on a per-jet 
basis. The uncertainties on these scale factor measurements are used as estimates of the systematic 
uncertainty on the ( 7 -tagging. The sources of uncertainty are decomposed into many uncorrelated 
components (24 for ( 7 -jets, 16 for c-jets, and 48 for light-flavour jets) and the impact on the rejection 
is added in quadrature for a fixed value of ^signal- The ( 7 -tagging of matched small-radius jets is also 
affected by uncertainties on the jet-energy scale and resolution. These quantities are varied within 
their uncertainties and if the shifted jet has px < 25 GeV, its MV 1 value is not considered. The impact 
of various uncertainties on the boson-type tagger built using only the ( 7 -tagging discriminant for a 10 % 
nominal Z efficiency is shown in Fig. 10. At this efficiency, the full boson-type tagger is dominated 
by the ( 7 -tagging inputs, as seen in Fig. 7. The scale factor uncertainty for ( 7 -jets has no impact on 
the W efficiency (no real ( 7 -jets), but there is approximately a 10% uncertainty on the Z efficiency. 
The uncertainties on the jet-energy scale for small-radius jets are relevant only because of the 25 GeV 
Px threshold. Since all of the large-radius jets are required to have px > 200 GeV, the threshold is 
relevant only in the rare case that one of the W daughters is nearly anti-parallel in the W rest frame to 
the direction of the W boost vector. 

The impact of the uncertainties on the jet-mass scale and resolution on the boson-type tagger built 
using all of the inputs (jet mass, jet charge, and ( 7 -tagging) is shown in Fig. 11(a). At very low Z- 
boson tagging efficiency, the tagger is dominated by ( 7 -tagging, so Fig. 10 is a good representation of 
the uncertainty on the full tagger’s performance. For higher efficiencies, the tagger is dominated by the 

* Although such shifts retain optimal use of the tagger (highest rejection for a fixed efficiency), they can degrade the quality 
of e.g. a cross-section measurement. 
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jet mass, although the jet charge and ^-tagging discriminant significantly improve the performance. 
The uncertainty on the full tagger’s performance at the 50% and 90% Z-boson tagging efficiency 
benchmark points is due mostly to the uncertainty on the jet mass, which is why these uncertainties 
are shown in Fig. 11. 


>; 1 

' ' ' 1 ' ' ' ' 1 ' ' ' ' 1 ' ' ' 

^ 0.5 

' ' ' ' N ' ' ' ' 1. 

O 

c 

ATLAS Simuiation 

o 

c 

ATLAS Sjmuiation 

CD 

'o 

Ws = 8 TeV, PYTHIA w ^ wz 

CD 

'o 

-is = 8T eVy PYTHIA w ^ wz 

iS 0.9 

jJet Mass Tagger 50% Benchmark J 

H— 

aj 0.4 

\Jet Mass Tagger 90% Benchmark J 

+ 

5 

1 


+ 

5 

1 

▼ \ 

0.8 

_ '•s _ 

0.3 





■■tv 


- Nominal 


-- Nominal 


• Benchmark 


• Benchmark 

0.7 

_T JMS Down * _ 

0.2 

_ T JMS Down _ 


- i JMS Up - 


- i JMS Up 


■ JMR (20 percent) 


■ JMR (20 percent) 


- + HERWIG - 


- -I- HERWIG 

r\ c 



1 1 1 1 1 1 1 1 1 1 1 1 i\ 1 1 1 1 1 


1.3 


0.4 


0.5 


0.6 0.7 

Z Efficiency 


0.8 


0.9 


1 1.1 
Z Efficiency 


>, 

o 

c 

0 ) 

o 


0.8 


LU 0.75 


0.7 


0.65 


O.f 


(a) 


1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 


1 1 1 1 1 \ 1 1 1 1 1 1 1 1 1 1 1 

ATLAS Simuiation 

O 

S 0.24 

ATLAS Simuiation 

r )[s = 8 TeV, PYTHIA w ^ wz 

'o 

(s = 8 TeV, pVthia w ^ wz 

J^.Charge Tagger 50% Benchmark J 

H— 

M— 

LU 

Jet Charge Tagger 90% Benchmark 

; ; 

^ 0.22 

1 

7 \ 7 


^ 0.2 

■ 




■-Nominal 


■-Nominal 

- o Benchmark 

0.18 

— o Benchmark \ — 

A Track Efficiency (r|) 


A Track Efficiency (ti) \ 

■ □ JER (20 percent) 


■ □ JER (20 percent) '' 

- 

0.16 

- \ — 

. HERWIG 


. * HERWIG 

1 i i 1 1 i i i 1 1 i 1 \ i i 1 i i i i 


.4 


0.45 


0.5 


0.55 0.6 

Z Efficiency 


0.8 


0.85 


0.9 


0.95 1 

Z Efficiency 


(b) 

Figure 9: The impact of selected systematic uncertainties on benchmark working points of the boson-type 
tagger, (a) a jet-mass-only tagger, for 50% (left) and 90% Z efficiency benchmarks, (b) a jet-charge-only 
tagger, for 50% (left) and 90% Z efficiency benchmarks. The point marked HERWIG uses the alternative shower 
and hadronization model for the simulation, with the likelihood template from PYTHIA. See the text for an 
explanation of the notation in the legend. 
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Figure 10; The impact of selected systematic uncertainties on benchmark working points of a h-tagging-only 
tagger at a 10% Z efficiency benchmark. The h-tagging discriminant is binned, so there are only discrete oper¬ 
ating points. The point marked HERWIG uses the alternative shower and hadronization model for the simulation, 
with the likelihood template from PYTHIA. The h-tagging scale factor uncertainties are determined separately 
for b-, C-, and light-quark jets. Variations are added in quadrature for each ‘truth’ jet flavour. There is no contri¬ 
bution from the h-jet scale factor uncertainties on the W rejection because there are no ‘truth’ h-jets. Conversely, 
the c- and light-jet scale factor uncertainties do not impact the Z bosons because at this low efficiency, all the 
selected Z bosons decay into bb. 


7. Validation of tagging variables using data 

The tagger cannot be fully tested with data because it is not possible to isolate a pure sample of 
hadronically decaying Z bosons in pp collisions. However, the modelling of the variables used to 
design the tagger can be studied with a relatively pure and copious sample of hadronically decaying 
W bosons in tt events which can be tagged by the leptonic decay of the other W boson in the event 
(semileptonic tt events). Single-lepton triggers are used to reject most of the events from QCD multijet 
background processes. Candidate reconstructed ti events are chosen by requiring an electron or a 
muon with pj > 25 GeV and \p\ < 2.5, as well as a missing transverse momentum > 20 GeV. 
The electrons and muons are required to satisfy a series of quality criteria, including isolation.^ Events 
are rejected if there is not exactly one electron or muon. In addition, the sum of the and the 
transverse mass^° of the W boson, reconstructed from the lepton and E™®®, is required to be greater 

^ Leptons are considered isolated if they are well separated from jets (AR > 0.4) and the track/calorimeter energy within a 
small cone, centred on the lepton direction but excluding the lepton itself, is below a fixed relative value. 

The transverse mass, mx, is defined as - cos(A0)), where A0 is the azimuthal angle between the lepton 

and the direction of the missing transverse momentum. 
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Figure 11; The impact of uncertainties on the jet-mass scale and resolution for 50% (a) and 90% (b) Z effi¬ 
ciency working points of the full boson-type tagger. The point marked HERWIG uses the alternative shower and 
hadronization model for the simulation, with the likelihood template from PYTHIA. 


than 60 GeV. Events must have at least one Z^-tagged jet (at the 70% efficiency working point) and 
have at least one large-radius trimmed jet with px > 200 GeV and \r]\ < 2. Furthermore, there must be 
a small-radius jet with px > 25 GeV, and A/? < 1.5 to the selected lepton (targeting the decay chain 
t bW{-^ €v)). The other W boson candidate is selected as the leading large-radius trimmed jet with 
^R >1.5 from the small-radius jet that is matched to the lepton. The IT-i-jets and multijet backgrounds 
are estimated from the data using the charge asymmetry and matrix methods, respectively [48]. The 
other backgrounds are estimated directly from MC simulation. Although the resulting event selection 
is expected to have a high tt purity (about 75%), the events cannot be compared directly to the isolated 
W bosons from the simulated W boson decays. This is because there are several effects that make 
the typical large-radius jet in semileptonic tt events different from isolated W and Z boson jets in W 
boson events": 

1. The event selection is based on the reconstructed jet px (earlier sections used p^), so even if 

let 

p]j, > 200 GeV for an 7? = 1.0 jet, the true hadronically decaying W boson in the event may 
have Pj < 200 GeV and thus the W boson decay products might not be collimated within 
AR < 1. 

2. There are more (close-by) jets in semileptonic tt events than in W boson events. Jets not 
originating from the W boson can form the leading large-radius jet, or the f?-jet from the same 
top-quark as the hadronically decaying W bosons can merge with the W boson decay products 
to form a large-radius jet. 

The variables pJ^Vand A7?(jet, W), for the W boson from the MC ‘truth’ record and the selected 
large-radius jet, are used to classify the various ti event sub-topologies. Events are labelled as having 


** When controlling for all differences, the distributions for isolated W bosons from tt and from W' are nearly identical. 
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a Boosted W if <0.1 and A/?(jet, W) <0.1. If the Z?-quark from the top-quark decay has 

an angular distance A/? <1.0 from the selected large-radius jet, this jet is labelled as ^-contaminated. 
All other tt events, including events where both W bosons decay into leptons, are labelled as Other. 
The pt spectrum of the jets from the classified events is shown in Fig. 12. In Fig. 12 and subsequent 
figures, sysfemafic uncerfainfies on fhe simulafion include fhe jef pj and jef mass uncerfainfies de¬ 
scribed in Seel. 6, buf exclude fracking uncerfainfies, which are sub-dominanf. Evenfs are vetoed if 
fhe selected large-radius jef has pT > 400 GeV or if fhe A/? befween fhe selecfed large-radius jef and 
a fagged Z?-jef is less fhan 1.0. This suppresses fhe f^-confaminafed tt evenfs. The effecfiveness of 
fhe tt evenf classificafion is mosf easily seen from fhe jef mass disfribufion, shown in Fig. 13(a). The 
mass of fhe boosfed W bosons from tt evenfs is peaked around my/, as is a small confribufion from 
fhe hadronically decaying W bosons in single-fop evenfs in fhe Wt channel. There is no peak af m, in 
fhe /7-contaminated specfrum because of fhe f^-jef vefo, buf fhere is a small non-resonanf confribufion 
below fhe fop-quark mass, due fo evenfs in which one W daughter is mafched wifh fhe f^-jef. This is 
akin fo fhe fj-jef-i-lepfon invarianf mass used in ofher circumsfances fo measure fop-quark properfies 
and nafurally has a scale around 150 GeV [49]. The low-mass peak in IT-tjefs and fhe ‘ofher’ tt evenfs 
is due fo fhe Sudakov peak from QCD jefs, fhe locafion of which scales wifh R x px. The dependence 
on pt of fhe IT-peak position in Fig. 13(a) is shown in Fig. 13(b). Evenfs wifh fhe leading jef in a 
window around fhe W mass, 50 GeV < < 120 GeV are selecfed and fhe median of fhe mass 

disfribufion is ploffed in Eig. 13(b) as a funclion of fhe jef px. The similar frend for fhe simulation and 
fhe dafa shows fhaf fhe combinafion of fhe reconsfrucfed jef-mass scale and ‘frufh’ jef-mass scale is 
well modelled. To quanlify fhe spread in fhe jef mass peak, various infer-quanfile ranges are shown as 
a function of px in Eig. 13(c). The infer-quanfile range of size 0% <X < 50% is defined as fhe differ¬ 
ence befween fhe 50% -i-A% quantile and fhe 50% - A'% quantile, and is a measure of fhe spread in fhe 
disfribufion. The widfh of fhe boosfed-VF mass peak is well modelled wifhin fhe slafisfical precision 
of fhe 2012 dafa sample. 

The modelling of boosfed W bosons can also be sfudied using fhe jef-mass scale measured from 
fracks. Defining fhe variable rtrack as fhe rafio of fhe jef mass defermined from fracks fo fhe jef mass 
defermined from fhe calorimefer, fhe jef mass scale uncerlainfy is relafed fo fhe difference from unify 
of fhe rafio of (rtrack) in dafa fo (rtrack) in MC simulafion. The mass scale uncertainly is calculaled 
using fhe procedure described above, buf wifh If fhe jef consisls only of pions, fhe nafural 

scale for r^ck is 2/3, alfhough fhere are significanf physics and defeclor effecfs fhaf infroduce a large 
spread of values. The disfribufion of r^ack in the rt-enriched event sample with the same px and 
/7-jet veto requirements as in Eig. 13 is shown in Eig. 14(a). Unlike the raw jet-mass distribution, 
the rtrack distribution is similar for all of the sub-processes, as expected. The scale and spread of the 
rtrack distribution are quantified in Eigures 14(b) and 14(c) using fhe px dependence of fhe median 
and infer-quanfile ranges. Previous sludies have indicated fhaf fhe Irack mulfiplicily, ntrack, in quark 
and gluon jefs is nol well modelled, especially for gluon jefs, where ntrack is lower in fhe dafa wifh 
respecf fo PYTHIA [50]. The disfribufion of fhe Irack mulfiplicily for large-/? jefs in fhe tf-enriched 
even! sample is shown in Eig. 15. The boosfed W evenfs are peaked al slighlly lower values of fhe 
number of associafed fracks compared fo fhe quark/gluon jefs from fhe ofher processes. The (charged) 
particle mulfiplicily increases for generic quark and gluon jefs as a funclion of jef energy. However, fhe 
mass-scale of fhe jefs produced from W boson decays is sel by my/ so fhaf in fhe absence of defeclor 
reconslruclion effecls, fhe Irack mulfiplicily disfribufion should be largely px independenl. The px 
dependence of fhe frack mulfiplicily is shown in Eig. 15(b) and 15(c) in fhe form of fhe median and 
fhe infer-quanfile ranges. The median does increase because of fhe large non-IT componenl as well as 
fhe finife detector accepfance for charged particles from fhe boosted W boson decay. The widfh is well 
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modelled within the statistical precision of the data. However, there is disagreement for the median. 
Previous studies (including Rev. [50]) suggest that this is due to fragmentation modelling and not the 
modelling of the detector response. 

The pT-weighted distribution of the track charges defines the jet charge, which is shown in Fig. 16(a). 
The charge of the lepton from the leptonic W boson decay determines the expected charge of the 
hadronically decaying W boson candidate, allowing for a tag-and-probe study of the capability of 
charge tagging in hadronic W boson decays [42]. The jet charge for boosted W bosons for positively 
(negatively) charged leptons is clearly shifted to the left (right) of zero. There is also some separation 
between positive and negative W boson decays when the selected large-radius jet does not satisfy the 
criteria for being a boosted W boson. This is because the jet still contains some of the W boson decay 
products, and the jet charge is correlated with the charge of the W boson. The difference between 
the inclusive and boosted IT-boson jets is clearer in the pr dependence plot of the median jet charge 
shown in Fig. 16(b). The medians of the distributions for boosted W jets are nearly twice as far 
apart as the medians for inclusive jets. However, in both cases the spread is less than the width of 
the distribution, shown as the inter-quantile range (inter-quantile range with X = 25%) in Fig. 16(c). 
Even though there is some small disagreement for the median number of tracks, the pT-weighted sum 
defining the jet charge is reasonably well modelled. 

The remaining input to the boson tagger is the (7-tagging discriminant for the matched small-radius 
jets. The efficiency-binned MVl distributions are shown in Fig. 17(a) and 17(b) with the same selec¬ 
tion criteria as for the previous figures, except that the (7-jet veto is removed. The contamination due 
to the (7-jet from the top-quark decay complicates a direct study of the MV 1 distribution for boosted W 
jets; contamination from the ( 7 -quark decay products is seen clearly in the MVl distribution at lower 
values of the efficiency. Most of the boosted W jets are in the highest efficiency bin because they have 
no real ( 7 -hadron decay. 

Overall, the simulation models all three input variables well. 


xIO^ 

^ ATLAS e-i-p 
o - \(s = 8 TeV, j L dt = 20.3 fb"' 



Figure 12: The px disffibution of the selected large-radius jets. The uncertainty band includes all the experi¬ 
mental uncertainties on the jet px and jet mass described in Sect. 6. 
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Figure 13: (a) The jet-mass distribution of the selected jets in semi-leptonic tt events, (b) The median of the 
mass distribution as a function of the jet pj for events with the selected jet in the range 50 GeV < < 

120 GeV. This includes the contributions from events which are not classified as Boosted W. (c) For the same 
events as in (b), the inter-quantile range as a measure of spread. The quantiles are centred at the median. The 
uncertainty band includes all the experimental uncertainties on the jet pj and jet mass described in Sect. 6. The 
inter-quantile range of size 0% < X < 50% is defined as the difference between the 50% -t X% quantile and the 
50% - X% quantile. Statistical uncertainty bars are included on the data points but are smaller than the markers 
in many bins. 
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Figure 14: (a) The distribution of rtrack in the data for semi-leptonic tt events with the selected jet in the range 
50 GeV < < 120 GeV. (b) The median of the rtrack distribution as a function of the jet pi. This includes the 

contributions from events that are not classified as Boosted W. (c) The inter-quantile range as a measure of the 
width. The quantiles are centred at the median. The uncertainty band includes all the experimental uncertainties 
on the jet pi and jet mass described in Sect. 6. The inter-quantile range of size 0% < X < 50% is defined as the 
difference between the 50% + X% quantile and the 50% - X% quantile. Statistical uncertainty bars are included 
on the data points but are smaller than the markers in many bins. 
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Figure 15; (a) The distribution of the number of tracks associated with the selected large-radius jet in the semi- 
leptonic ff data for events with the selected jet in the range 50 GeV < <120 GeV. (b) The median of the 

distribution of the number of tracks as a function of the jet pj. This includes the contributions from events 
that are not classified as Boosted W. (c) The inter-quantile range as a measure of the width. The quantiles are 
centred at the median. The uncertainty band includes all the experimental uncertainties on the jet pj and jet 
mass described in Sect. 6. The inter-quantile range of size 0% < X < 50% is defined as the difference between 
the 50% -I- X% quantile and the 50% - X% quantile. Statistical uncertainty bars are included on the data points 
but are smaller than the markers in many bins. 
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Figure 16: (a) The distribution of the jet charge in the data for semi-leptonic tt events with the selected jet in 
the range 50 GeV < < 120 GeV. The ratio uses the positive lepton charge, (b) The median of the jet charge 

distribution as a function of the jet p^. This includes the contributions from events that are not classified as 
Boosted W (except for the blue triangles, for which only the Boosted W is included), (c) The inter-quartile 
range as a measure of the width. The quantiles are centred at the median. The uncertainty band includes all the 
experimental uncertainties on the jet pj and jet mass described in Sect. 6. The inter-quantile range is defined as 
the difference between the 75% quantile and the 25% quantile. Statistical uncertainty bars are included on the 
data points but are smaller than the markers in many bins. 
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Figure 17; The efficiency-binned MVl distribution for the (a) leading and (b) sub-leading matched small-radius 
in semi-leptonic tt events. If there is no second small-radius jet with pj > 25 GeV and AR < 1 to the selected 
large-radius jet axis, the event is put in the ‘No jet’ category in (b). The uncertainty band includes all the 
experimental uncertainties on the jet pi and jet mass and those related to the ^-tagging described in Sect. 6. 
Statistical uncertainty bars are included on the data points but are smaller than the markers in many bins. 
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8. Outlook 


The simulation studies of the boson-type tagger presented in Sect. 5 show that for bosons with 
200 GeV < Pi < 400 GeV, it is possible to achieve Z-boson efficiencies of ez - 90%, 50%, and 
10% with W' boson rejections of 1.7,8.3 and 1000, respectively. Putting this into context, with R{ez) 
defined as the lowest possible IT-boson tagging efficiency at a fixed Z-boson fagging efficiency: 

• The WZjWW cross-secfion ratio is ~ 20% [51]. Af fhe 50% fype-fagger working poinf, one can 
change fhe rafio of evenfs fo 


50% cr(lTZ) _ 50% cr(lTZ) 
/?(50%) ^ o-{WW) ~ 12% ^ o-{WW) 


U 


x20% 


83%, 


(4) 


wifh fhe possibilify for a high-purify exfraclion of fhe fTZ cross secfion in fhe semilepfonic 
channel {€vqq). 

• Diboson resonances are predicted by many models of physics beyond fhe Sfandard Model. The 
all-hadronic channel provides a significanlly higher yield fhan fhe lepfonic channels. Af fhe 
90% fype-fagger working poinf, one can disfinguish ZZ from WZ wifh a likelihood rafio of 
0.92/(0.9 X 0.6) ~ 1.5. 

• Af fhe 10% fype-fagger working poinf, a lepfophobic flavour-changing neufral currenf in tt 
producfion (wifh decays like in fhe SM) wifh a branching rafio of 1% would have fhe same 
number of evenfs as fhe t —> bW decay: 


10% r(t^Zc) 
/?(10%) ^ r{t Wb) 


10% r(t^Zc) 
0.1% ^ r(t ^ Wb) 


- 100 X 1% - 100%. 


(5) 


Only fhe range 200 GeV < pj < 400 GeV was sfudied fhus far due fo fhe availabilify of W bosons 
in fhe dafa. Figure 18 shows how fhe average and sfandard deviafion of fhe jef mass, jef charge and 
mulfiplicify of fhe mafched small-radius f^-lagged Jefs disfribufions depend on jef pj in simulation up 
fo 1 TeV. As long as fhe jef pt is high enough so fhaf a single jef capfures all of fhe boson decay 
producfs, fhe jef mass and jef charge disfribufions are predicfed fo be largely independenf of pj- The 
informafion from fj-fagging degrades around 400 GeV as fhe fwo decay producfs from fhe boson 
become too close fo resolve fwo separate jefs. 


9. Conclusions 

A fagger for disfinguishing hadronically decaying boosfed Z bosons from W bosons using fhe ATLAS 
defecfor has been presented. If will mosf likely be used after a boson fagger has rejecfed mosf QCD 
mulfijef evenfs. Three discriminafing variables are chosen which are sensifive to fhe differences in 
boson mass, charge, and branching rafios fo specific quark flavours: large-radius jef mass, large- 
radius jef charge, and an associafed small-radius jef f^-lagging discriminanf. For moderafe and high 

Up to impurities due to the high-occupancy tt environment. 
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Figure 18: The boson px dependence of the (a) jet mass, (b) jet charge, and (c) number of small-radius / 7 -tagged 
jets matched to the large-radius jet. 


Z-boson tagging efficiencies, the jet mass is the most discriminating of the three variables, but there is 
significant improvement in discrimination when combining all three inputs into a single tagger. At low 
Z-boson efficiencies, smaller than the Z ^ bb branching ratio, the Z^-tagging discriminant is the most 
useful for rejecting W bosons. The full tagger is largely unaffected by many systematic uncertainties 
on the inputs, with the exception of the uncertainties on the jet-mass scale and resolution. While it 
is not possible to measure the tagger efficiencies directly in data due to the lack of a pure sample 
of boosted, hadronically decaying Z bosons, modelling of the likelihood function using hadronically 
decaying W bosons has been studied in the data. Overall, the simulation agrees well with the 20.3 
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fb ^ of = 8 TeV pp data collected at the LHC. 
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A. Correlations with 2-subjettiness 


The tagger developed in this paper is designed to work in conjunction with a procedure for separating 
generic quark and gluon jets from boson jets. Figure 19 shows the joint distribution of the jet mass and 
jet charge with a standard boson tagging variable 2-subjettiness, T 2 i- The boson type tagger variables 
are nearly independent of T 21 . 
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Figure 19: The joint distribution of (a) jet mass and 2-subjettiness and (b) jet charge and 2-subjettiness for (left) 
W boson jets and (right) Z boson jets. 
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